Maximum size of reverse-free sets of permutations 

Josef Cibulka 1 



1 Department of Applied Mathematics, 
Charles University, Faculty of Mathematics and Physics, 
Malostranske nam. 25, 118 00 Praha 1, Czech Republic; 
cibulka@kam.mff . cuni . cz 

Abstract 

Two words have a reverse if they have the same pair of distinct letters on the 
same pair of positions in reversed order. A set of words no two of which have a 
reverse is said to be reverse-free. Let F(n, k) be the maximum size of a reverse- 
free set of words from [n] k where no letter repeats within a word. We show the 
following lower and upper bounds in the case n > k: F(n, k) € n fc £;~ fc /2+0(fc/iogfc)_ 
As a consequence of the lower bound, a set of n-permutations each two having a 
reverse has size at most n n / 2 +0(n/\ogn) _ 



1 Introduction 

Let [n] be the set of integers between 1 and n. A word w of length k over the alphabet 
A is a sequence w±, . . . , Wf. of elements from A. The set of all words of length k over [n] 
is [n] k . A word is repetition-free if it contains at most one occurrence of each symbol. 
The set of all repetition- free words of length k over [n] is [rqVfc). Notice that when n = k, 
the set [n]( n ) is the set S n of permutations on n elements . A code J 7 of length k is a 
subset of [n] h . The size of J 7 is the number of words in J 7 . Codes are usually defined 
to be sets of words that in some sense significantly differ from each other in order to 
be distinguishable when transmitted over a noisy channel. We study reverse-free codes 
introduced by Fiiredi, Kantor, Monti and Sinaimeri [?]. Two words w and x have a 
reverse if for some pair of positions, we have Wi ^ Wj, Wi = Xj and Wj = Xi. If 
w and x do not have a reverse, they are reverse-free. A code is reverse-free if its words 
are pairwise reverse-free. Let F(n, k) be the size of the largest reverse-free code over [n] 
of length k. Let F(n, k) be the size of the largest reverse-free code over [n] of length k 
containing only repetition-free words. Let 
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The limit exists for every k > 1 [7J. We will use the following equivalent definitions of 
the limit: 

m = Um = lim 

The first equality follows from the fact that lim n _ s>00 (2)k\n~ k = 1. The second equality 
is a consequence of the observation that for every fixed k, we have F(n, k) < F(n, k) < 
F(n, k) + 0(n k ~ l ) [TJ. The only exact values of the limit known are f(l) = 1, f(2) = 1/2 
and /(3) = 5/24 [7J. 

We tighten the bounds on the maximum size of reverse-free codes of length greater 
or equal to the size of the alphabet. 

Theorem 1.1. For every n > k, we have 



n 



k k -k/2-o(k/io S k) < F ( njA .) <F(n,k) < n " k -k/2+o(k/^k) t 



The first inequality is proven in Section [2] as Corollary 12.51 and the last inequality is 
proven as Claim in Section [3J As an immediate consequence, we obtain the following 
bounds for permutation codes: 

n n/2-0(n/lo g n) < F fa n ) < J(n, n) < „«/ 2 +° W lo g ") . 

and for the limit for codes of fixed length k: 

f(k) G k~ k/2+ °( k/logk \ 

A set of words is full of flips if each two words from the set have a reverse. Let 
G(n, k) be the size of the largest code full of flips with elements in [n] k . Let G(n, k) be 
the size of the largest code full of flips with elements in [n](k)- By G(n, n)F(n, n) < n\ [7J, 
we obtain the following corollary. 

Corollary 1.2. The size of a set of permutations full of flips is at most 

G(n,n) < n «/2+0(n/logn)_ 

A position of an entry of a matrix is represented by a pair (r, c) of the row number 
r and the column number c. A {0, l}-matrix is a matrix whose each entry is either or 
1. Every matrix in this paper is a {0, l}-matrix, even if not explicitly mentioned. 

All logarithms in this paper are of base 2. 



2 Lower Bound 

A submatrix of a matrix B is a matrix that can be obtained from B by the removal of 
some columns and rows. An mxn {0, l}-matrix B contains a k xl {0, l}-matrix S if B 
has a k x / submatrix T that can be obtained from S by changing some (possibly none) 
0-entries to 1-entries. Otherwise B avoids S. 

Fiiredi and Hajnal [S] studied the following problems from the extremal theory of 
{0, l}-matrices. Given a matrix 5* (the forbidden matrix), what is the maximum number 
of 1-entries in an n x n matrix that avoids S7 
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We restrict our attention on forbidding the 2x2 matrix with each entry equal to 1 
and we call this matrix S. This question is closely related to the extremal problem of 
determining the maximum number of edges in an n-vertex graph without a 4-cycle as a 
subgraph. The maximum number of edges in a 4-cycle-free graph is known precisely for 
infinitely many values of n [5] . We will use a classical construction of a bipartite 4-cycle- 
free graph (see for example the book of Matousek and Nesetfil [8]). We reproduce the 
construction here in the matrix setting since we need some of its additional properties. 

The construction of a matrix avoiding S builds the matrix using a finite projective 
plane. Let X be a finite set and let £ be a family of subsets of X. The set system (X, C) 
is a finite projective plane if 

(PO) There is a 4-tuple F of elements of X such that \F D L\ < 2 for every L G C. 
(PI) For every Li, L 2 G £, \L X n L 2 \ = 1. 

(P2) For every x, y G X there exists exactly one L G L containing both x and y. 

For every finite projective plane, we can find a number r, called the order of the 
projective plane, satisfying: 

(P3) For every L G C, \L\ — r + 1. 

(P4) Every x G X is contained in exactly r + 1 sets from C. 

(P5) We have \X\ = \C\ = r 2 + r + 1. This value is the size of the projective plane. 

It is known that for every number r that is a power of a prime number, we can find 
a finite projective plane of order r [8]. 

Claim 2.1. If n is of the form r 2 + r + 1, where r is a power of a prime, then 

F{n,n) > ^/2-o(«/iogn)_ 

Proof. We fix a projective plane (X, C) of size n. We order the elements of X and the 
sets of £ arbitrarily. The incidence matrix of a finite projective plane of size n is the 
n x n matrix A with 1 on position (i, j) exactly if the i-th set of C contains the j-th 
element of X. Let A be the incidence matrix of (X, £). 

An n-permutation matrix is an n x n matrix with exactly one 1-entry in every column 
and every row. An n-permutation is a permutation on n elements. The set of n- 
permutations and the set of n-permutation matrices are in the following bijection. The 
permutation it is matched with the matrix P with 1 on position (i, j) exactly if 7Tj = j. 
Let V be the set of n-permutation matrices contained in A and let II be the set of 
n-permutations matched to the matrices from V. By (P3) and (P4), A has exactly r + 1 
l's in every row and every column. Thus by the van der Waerden conjecture proved 
independently by Falikman [4] and Egorychev [5], 

|n| = \v\ > (^)^! > (^) n > (^r^f ^ ^ /2 -° (n/logn) . 

We claim that the set II is pairwise reverse-free. For contradiction, we take 7r G LT 
and p G LT with a reverse on positions i and j. That is, 7Tj = k, ttj — I, pi — I, pj = k 
for some k and /. Since P^ and P p are contained in A, this implies that A contains the 
matrix S on rows i,j and columns k and Z; a contradiction with (PI). □ 
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By the prime number theorem, the gaps between two consecutive prime numbers in 
proportion to the primes tend to zero. There has been a significant progress in tightening 
the gap between two consecutive primes. Most recent is the following result of Baker, 
Harman and Pintz pQ. 

Theorem 2.2 (Baker, Harman, Pintz, 2001). For every large enough n, the interval 
[n — n 0,525 , n] contains a prime number. 

Lemma 2.3. For every n, 

F(n,n) > n "/2-0(n/logn)_ 

Proof. For an arbitrary n we take the largest n' smaller than n and expressible as p 2 +p+l 
for some prime p. We have 

p > n l/2 _ 1 _ n 0-525/2 and go 

ri >n-0{n 1 -™' 2 ). 

We take the set II' of (ji'^ n / 2 ~ i n '/ lo ^ n ) pairwise reverse-free n'-permutations from 
Claim 12.11 We append the sequence (n' + 1, n' + 2, . . . , n) to the end of each ir' G IT. 
Let the resulting set of n-permutations be II. The set II of permutations is pairwise 
reverse-free and has size at least n n / 2 -°( n / l °g n ) . □ 



Lemma 2.4. For every n > k, 



n 



k 

F(n,k)> - F(k,k). 
LfcJ 

Proof. Let II be a reverse- free set of /c-permutations of size F(k,k). Given a word 
u = (ui, «2, • • • , Uk) G N(fc), w e call the word {u\ mod k, u 2 mod k, . . . ,Uk mod k) the 
compression of u. Let J 7 be a set of all the words in [n]i^ whose compression is in II. The 

size of T is at least I rj ^ | TT | . It remains to show that T is reverse-free. For contradiction, 
assume that some pair of words (ui, . . . , «&) and (v\, . . . , Vk) has a reverse on the pair 
of positions. That is, = Vj and Uj = i>j and, in particular, Ui mod k = Vj mod k 
and Uj mod k = Vi mod k. Because the compression of u is a permutation, mod k ^ 
Uj mod k. This is a contradiction, because the compressions of u and v are in the 
reverse-free set IT. □ 



Corollary 2.5. For every n > k, 

F(n,k)>n k k- k ^-°^^ k \ 



Proof. Since n> k, we have \n/k\ > n/(2k). Therefore, by Lemmas 12.31 and 12.41 
F(n, k) > (^-) k F(k,k) > ^—k k / 2 ~°W l °^ > n k k -k/2~o(k/io gk )^ 



□ 
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3 Upper Bound 



We adapt the classical proof of the 0(n 3 / 2 ) upper bound on the number of edges in a 
4-cycle-free graph to show that a matrix with many 1-entries contains many occurrences 
of the matrix S — (}}). That is, there are many pairs of rows and pairs of columns of 
A having l's in their four intersections. 

Lemma 3.1. Let k and n be integers such that n > k > 1 and let m be a real number 
from the closed interval [1,/c 1 / 2 ]. Let A be a k x n {0, l}-matrix with at least rank 1 / 2 
1-entries. The number of occurrences of S in A is at least 

*v a 4 -i) a 

Proof If k = 1, then A has at most n l's and so m = 1 and the claim is trivially 
satisfied. So we assume that k > 2. We also assume that A has no empty rows. If A 
has empty rows, we remove them and use the claim for the matrix with no empty rows. 
Since the removal decreases k, keeps n the same and increases m, we obtain the desired 
estimate. 

We first count the number q of pairs of l's that are in the same row. Let di be the 
number of l's in the i-th row of A. We have 

f d 'i\ ^ ( d i - l f 

1= E {2)^—2— 

{i:d x >2} V 7 i=l 

Let d be the average number of l's in a row, that is, 

k k 

By the convexity of the function f(x) — (x — l) 2 /2, we have 

q > k^-^ > (mnk-^ - I) 2 > {mU -/ /2)2 >^f- 0(mnk^). 

Let r^j be the number of rows that have a 1-entry in columns % and j. Let 1Z be the 
set of pairs {i,j} of column indices satisfying 1 < % < j < n and r^j > 2. 

First, we consider the case q < n 2 /2. From the estimate q > (run — k 1 ^ 2 ) 2 /2, we 
obtain m < 1 + k x l 2 jn. Therefore n 2 (m 2 — l) 2 e and the result holds trivially, 

because k < n. 

Now, we assume q > n 2 /2, which implies \7Z\ > 0. By double counting, 



q= r ^ ^ E r ^'+ ( 9 ) ~ 1^1- 

i<i<j<n {i,i}e7e 



Let 



def E{i,j}gft r »J > g ~ (0 ~ 1^1) = g ~ (2) , 1 
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Let s be the number of occurrences of S in A, that is, s = 
convexity of f(x) = (x — l) 2 /2 and since r > 1, we have 



( r «). By the 



(m 2 n 2 /2 - 0{mnk 1 ' 2 ) - n 2 /2) 2 




> 



> 



2\n\ 

{n 2 {m 2 - l) - Oimnk 1 ' 2 )) 2 



An 2 



> 



2(m ' 1)2 -0(m 3 nk^ 2 ). 
4 v ' 



□ 



We first give some definitions and outline the proof of the upper bound in Theo- 
rem 11.11 without mentioning precise values used. We use a modification of a method of 
Raz [S] , that was used for proving upper bounds in another extremal problem on sets of 
permutations [HIE]- 

A k x n word matrix is a k x n matrix with exactly one 1-entry in every row. A 
k x n word u is a sequence u±,U2, ■ ■ ■ , Ufe of k letters from the alphabet [n\. The set of 
k x n words and the set of k x n word matrices are in the following bijection. The word 
u is matched with the matrix U having 1 on position (i,j) exactly if itj = j. A set IA of 
k x n word matrices is reverse-free if the set of corresponding words is reverse-free. 

Given a set IA of k x n word matrices, we let the overall matrix An be the k x n 
matrix having 1-entries on those positions where at least one matrix of IA has a 1-entry. 
The basic idea is to design a procedure that shrinks the set U in order to decrease the 
number of l's in the overall matrix. When the overall matrix has few l's, we use a 
trivial estimate on the size of what remained in U. By analyzing the procedure, we then 
deduce that the original size of U was small. 

The shrinking procedure uses the result of Lemma 13.11 applied on the overall matrix. 
Assume that the overall matrix contains S on the intersection of rows r\ and r2 and 
columns C\ and C2- Let an avoided pair be a pair of 1-entries of the overall matrix that do 
not appear together in any matrix in U. By the reverse-free property of U, we know that 
at least one of the two pairs {(ri,ci), (r 2 ,C2)} and {(ri,C2), (r 2 ,ci)} is avoided. When 
the overall matrix contains many occurrences of S, we find a 1-entry (r, c) occurring in 
many avoided pairs. If the 1-entry (r, c) is not present in enough matrices from U, we 
remove from IA all the matrices containing (r, c), thus removing (r, c) from the overall 
matrix. Otherwise, we keep only the matrices that contain (r, c), thus removing all the 
matrices containing any of the 1-entries that appear in some avoided pair together with 



Given a reverse-free set IA of k x n word matrices, let Ay be the overall matrix of IA. 
Let the weight \An\ of the overall matrix be the number of its 1-entries. The density of 
the overall matrix is rriu = \Au\j (nk 1 ^ 2 ). The 1-entry of the overall matrix An on the 
position (r, c) is light if the number of matrices U alA having 1 on position (r, c) is at 
most \U\/n. Let the emptiness zu oilA be the number of rows of Au with at most one 
1-entry. 



(r, c). 
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Observation 3.2. LetU be a reverse-free set of k x n word matrices such that Au has 
a light 1- entry. Then the setW of word matrices ofU not containing the light 1- entry 
satisfies 

\U'\> (l-i) \u\, 

\Au'\ < \Au\ ~ 1 an d 
zw > z u - 

Let n be a constant such that for every n > n , k < n and m > 5, every matrix 
A with mnk x l 2 l's contains n 2 m 4 /3 occurrences of 5. The existence of n follows from 
Lemma 13.11 

Claim 3.3. Let n > uq and let k < n. LetU be a reverse-free set of kxn word matrices 
such that Au has no light 1- entry and with density mu > 5. Then there exists a set 
W C U satisfying 

iw'l > M, 

n 

\Au>\ < and 
zw > z u + 1. 

Proof. The overall matrix Au contains n 2 m u /3 occurrences of S. So at least n 2 m u /3 
pairs of 1-entries of Ay are avoided. Thus there is a 1-entry of An such that the number 
of avoided pairs containing this 1-entry is at least 

2n 2 mf 4 2n 2 m^ l 2nmf /l 
3\A U \ = 3nk^ 2 m u = 3k 1 / 2 ' 

Let (r, c) be the position of this 1-entry. Let W be the set of those matrices from U 
that have 1 at position (r, c). We consider a position (r',c') such that {(r, c), (r',c')} is 
an avoided pair. Every matrix U' G W has at position (r',c'). So also Aw has at 
position (r',c'). Therefore \Aw\ < \Au\ — 2nm u / [3k 1 / 2 ). Because (r, c) is not a light 
1-entry, |W| > \U\/n. Since W contains only word matrices, the matrix Aw contains 
only one 1-entry in row r. On the other hand, the 1-entry at position (r, c) is contained 
in at least one occurrence of S in An, so Ajj contains more than one 1-entry in row r. 
Thus Zw > zu + 1. □ 

Claim 3.4. LetU be a set of n x k word matrices, where n > k. IfU is reverse-free, 
then 

\U\ < n k k -k/2+0(k/lo g k) 

Proof. We first consider the case that the density mu of the overall matrix is smaller 
than 10. Since the number of k x n word matrices contained in Ay is maximized when 
each of its rows has the same number of 1-entries, we obtain 



\U\ < (lOnk 



-l/2\fc 
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and the result holds. 

Otherwise, we apply the following procedure on U. We proceed in several steps. Let 
Hi dU be the set of word matrices before the step i. Let IA\ = hi. If the overall matrix 
at the beginning of the step % has a light 1-entry, we obtain from Hi by applying 

Observation 13. 2\ otherwise, by applying Claim 1331 Let m ; = m u . and = A Ui . Light 
steps are the steps when Observation 13.21 is applied and heavy steps are the remaining 
ones. The steps are further grouped into phases. Phase 1 starts with step p\ — 1. 
For every j > 2, phase j starts with step pj chosen as the smallest index such that 
< m p . _ 1 /2. The last phase is the first phase £ that decreases the density of the 



rn 



i-'j 



overall matrix below 10. So at the beginning of the last phase, we have 

m pi > 10. 



Because each light step decreases the number of l's in the overall matrix by 1, only 
at most nk light steps are done during the whole procedure. 

It remains to count the heavy steps. At the beginning of a heavy step i of phase j, 
we have 

mi > m p ./2. 



By Claim 13.31 the heavy step decreases the number of 1-entries in the overall matrix by 

,3 



\AA L4 



i+l 



2nm? 
> A- > 



nm 



pj 



3k 1 / 2 '- 12k 1 / 2 ' 

Since the phase j ends at the moment when at least \A Pj \/2 1-entries are removed, the 
number of heavy steps of phase j is at most 



nk 1 / 2 m Pj /2 




6k 


nm*./ (12k 1 / 2 ) 







Each phase shrinks the weight of the overall matrix by a factor of at least 2, so m Pj > 
m Vl 2 e '~^ for every j. We also have for every j G {1, . . . , £} 

10 < m Pj < k 1/2 . 

Let t be the total number of heavy steps. We have 



6A; 



rn 



Pi 



3=1 



Ik 



m Pl 2^) 2 



Ik ~ 

^ —rY T 

m z L — 4 



2j 



< 



3=0 



7k A k 
mj e 3 -TO" 



Let W be the set of word matrices after phase £. During the whole procedure, at 
most nk light steps and t < k/10 heavy steps were made. We have 

nk 



\U'\ > \U\ 



l-I 

n 



n 



-t 



(3.1) 



The overall matrix Ayi has at most IQnk 1 / 2 1-entries and at least t rows with a single 
1-entry. The number of k x n word matrices contained in Ayi is maximized when each 
of its rows with at least 2 1-entries has the same number of 1-entries. Thus, 



\U'\ < 



lOnk 1 / 2 
k-t 



k-t 



< rf 



k-t 



12 

W 2 



since t < k/10. 



(3.2) 
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By combining Equations (13. ip and ()3.2|) . we conclude that 

|W| < n fc -*(12ife- 1 / 2 )*e 2fc n < < ^-WW^). 

□ 
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